Demographer: Extremely Simple Name Demographics
نویسندگان
چکیده
The lack of demographic information available when conducting passive analysis of social media content can make it difficult to compare results to traditional survey results. We present DEMOGRAPHER,1 a tool that predicts gender from names, using name lists and a classifier with simple character-level features. By relying only on a name, our tool can make predictions even without extensive user-authored content. We compare DEMOGRAPHER to other available tools and discuss differences in performance. In particular, we show that DEMOGRAPHER performs well on Twitter data, making it useful for simple and rapid social media demographic inference.
منابع مشابه
Working Hard for the Money Trends in Women ’ s Employment 1970 to 2007 KrisTin sMiTH rEporT s on rur al aMErica
the neil and louise Tillotson Fund of the new Hampshire charitable Foundation. Family Demographer The carsey institute university of new Hampshire
متن کاملThe curiously misunderstood role of evidence in designing new technology.
" Flying safely has not developed using experimental and control groups of air passengers and counting victims. " —Ülo Kristjuhan 75 R ECENTLY, I TOOK PART in a radio interview together with the demographer Jay Olshansky. Jay and I have been friends for over a decade, and we have done this before: indeed , he and I have for most of that period been among the most frequently appearing academic g...
متن کاملIdentifying Participants in the Personal Genome Project by Name (A Re-identification Experiment)
We linked names and contact information to publicly available profiles in the Personal Genome Project. These profiles contain medical and genomic information, including details about medications, procedures and diseases, and demographic information, such as date of birth, gender, and postal code. By linking demographics to public records such as voter lists, and mining for names hidden in attac...
متن کاملAuthor name disambiguation: What difference does it make in author-based citation analysis?
In this paper, we explore how strongly author name disambiguation (AND) affects the results of an author-based citation analysis study, and identify conditions under which the commonly used simplified approach of using surnames and first initials may suffice in practice. We compare author citation ranking and co-citation mapping results in the stem cell research field 2004-2009 between two AND ...
متن کاملPath ORAM: An Extremely Simple Oblivious RAM Protocol Citation
We present Path ORAM, an extremely simple Oblivious RAM protocol with a small amount of client storage. Partly due to its simplicity, Path ORAM is the most practical ORAM scheme known to date with small client storage. We formally prove that Path ORAM has a O(logN) bandwidth cost for blocks of size B = Ω(logN) bits. For such block sizes, Path ORAM is asymptotically better than the best known OR...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016